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NUCLEIC ACID DIAGNOSTICS BASED ON MASS SPECTROMETRY OR MASS SEPARATION AND BASE SPECIFIC 
CLEAVAGE 

FIELD OF THE INVENTION 

5 The present invention relates generally to a method for detecting a mutation in a nucleic acid 
molecule. The method of the present invention does not require prior knowledge of a reference 
or wild-type nucleotide sequence nor does it require a gel electrophoresis step. The method of 
the present invention is particularly useful in identifying mutations and polymorphisms in 
genomic DNA and more particularly in the human genome and to determine and/or confirm the 
10 nucleotide sequence of target nucleic acid molecules. The method of the present invention may 
also be automated. 

BACKGROUND OF THE INVENTION 

15 Bibliographic details of the publications referred to by author in this specification are collected 
at the end of the description. 

The increasing sophistication of recombinant DNA technology is greatly facilitating research and 
development in a range of biotechnological fields. A particularly important area is the generation 
20 of nucleotide mutants and the screening for and identification of such mutants. This in turn has 
implications, for example, in understanding the genetic basis behind certain disease conditions 
which is becoming of increasing relevance as the human genome is progressively sequenced. 

An efficient and accurate method of mutation detection is crucial in implicating disease candidate 
25 genes and in the screening programs which follow identification of disease causing mutations. 
Many human inherited and sporadic disorders are caused by small mutations including base 
substitutions, additions and deletions. Among these disorders are the Mendelian single gene 
disorders, sporadic somatic mutations causing cancers and complex genetic traits. Whilst some 
diseases are caused by a limited and well characterised set of mutations, most genetic diseases 
30 are caused by one or more of a large range of mutations occurring anywhere within the gene. 
It is important, therefore, that a mutation detection protocol be able to scan a region of DNA, 
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identify any change and describe the resulting nucleotide differences from wild-type. With the 
increasing use of population molecular genetics and as clinicians begin to use mutation analysis 
as a clinical tool, there is a need to develop mutation detection protocols which can be 
automated, are less dependant on user expertise and are more accurate and reliable. 

5 

Current mutation detection protocols require either a gel based detection system or sequence 
specific primers. Gel based detection methods include direct sequencing of amplified DNA 
fragments and various techniques involving either cleavage of mismatched bases in 
heteroduplexes or mobility differences of single or partially denatured DNA strands. 

10 

Detection of mutations by DNA sequencing can provide good results in relation to accuracy and 
information about the position and nature of the mutation (Hattori et al, 1993), however, 
although advances have been made in this area, the technique is not fully automated and is labour 
intensive. Most mutations occur as heterozygotes and there are technical difficulties with the 
15 ability of currently available computer software to identify two different nucleotide bases at a 
mutated residue. 

Many mutation detection techniques exploit differential electrophoretic mobilities of DNA 
fragments with sequence differences. Single strand conformation polymorphism (SSCP) exploits 

20 the fact that the secondary structure of a single strand of DNA is sequence based and, therefore, 
strands with even just one base difference will migrate at a different rate (Orita et al t 1989). This 
technique is again gel based and can lack sensitivity. Furthermore, the method cannot be readily 
automated and requires a large amount of labour due to the necessary gel step which in most 
cases must be optimised to the specific sample being analysed. They also do not give any 

25 information about the position or nature of the change and do not routinely identify all mutations. 

Mutation detection based on the identification of base pair mismatches in heteroduplex DNA 
strands is another method of identifying point changes. There are a number of techniques 
available that cleave DNA at mismatched base pairs in heteroduplex DNA. Mismatch cleavage 
30 protocols include chemical and enzymatic mismatch cleavage. The techniques are also gel based. 
The chemical cleavage method uses osmium tetroxide to cleave at the mismatched base (Cotton 
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et al, 1988) followed by separation of cleaved products on denaturing gels. A major 
disadvantage of the chemical cleavage protocol is the use of extremely toxic chemicals. 

Other methods for detection of known mutations include minisequencing allele specific 
5 polymerase chain reaction (PCR), oligonucleotide probe arrays (Lipshutz et al, 1995) which 
requires knowledge of the sequence of wild-type and mutant. Although this technique is suitable 
for non-gel based detection methods, it is only useful for know mutations. Furthermore, the 
large number of oligonucleotides required to cover all known mutations in many genes makes 
this approach prohibitively expensive and labour intensive. 

10 

With the development of the matrix assisted laser desorption ionisation - time of flight mass 
spectrometer (MALDI-TOF MS), the ability to accurately determine the mass of biomolecules 
of a limited size has been achieved. Although detection of DNA fragments of up to 622 base 
pairs in length has been reported, large fragments cannot be accurately sized and a mass accuracy 
15 of ±3bp is quoted (Liu et al f 1995). This level of accuracy is clearly insufficient for the detection 
and characterisation of base substitutions. 

There is a need, therefore, to develop an effective and accurate means of detecting mutations in 
nucleic acid molecules. Preferably, the mutation detection system would be automatable. 

20 

In work leading up to the present invention the inventors developed a mutation detection system 
which exploits the accuracy of mass determination of MALDI-TOF MS and which is applicable 
for large DNA fragments. The method of the present invention do not require gel 
electrophoresis nor is prior knowledge of the nucleotide sequence necessary. The method of the 
25 present invention is also capable of being automated. 
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SUMMARY OF THE INVENTION 

Sequence Identity Numbers (SEQ ID NOs.) for the nucleotide and amino acid sequences referred 
to in the specification are defined following the bibliography. 

5 

Throughout this specification and the claims which follow, unless the context requires otherwise, 
the word •■comprise", or variations such as "comprises" or "comprising", will be understood to 
imply the inclusion of a stated integer or group of integers but not the exclusion of any other 
integer or group of integers. 

10 

One aspect of the present invention contemplates a method of detecting a difference of one or 
more nucleotides between a nucleic acid molecule to be tested and a reference nucleic acid 
molecule, said method comprising subjecting the test nucleic acid molecule to base specific 
cleavage to generate oligonucleotide fragments, separating the resulting oligonucleotide 
15 fragments based on mass by MALDI-TOF MS and/or other equivalent procedure to produce a 
fingerprint of the oligonucleotide fragments comprising one or more peaks wherein a peak 
represents the mass of each fragment and identifying an altered peak relative to a reference 
nucleic acid molecule subjected to the same procedure wherein the presence of an altered peak 
is indicative of a difference of one or more nucleotides in said tested nucleic acid molecule. 

20 

Another aspect of the present invention provides a method of detecting a difference of one or 
more nucleotides between a nucleic acid molecule to be tested and a reference nucleic acid 
molecule, said method comprising amplifying said test nucleic acid molecule by polymerase chain 
reaction (PCR), subjecting the test amplified nucleic acid molecule to base specific cleavage to 

25 generate oligonucleotide fragments, separating the resulting oligonucleotide fragments based on 
mass by MALDI-TOF MS and/or other equivalent procedure to produce a fingerprint of the 
oligonucleotide fragments comprising one or more peaks wherein a peak represents the mass of 
each fragment and identifying an altered peak relative to a reference nucleic acid molecule 
subjected to the same procedure wherein the presence of an altered peak is indicative of a 

30 difference of one or more nucleotides in said tested nucleic acid molecule. 
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Yet another aspect of the present invention is directed to a method of detecting a difference of 
one or more nucleotides between a nucleic acid molecule to be tested and a reference nucleic 
acid molecule, said method comprising amplifying said test nucleic acid molecule by PCR, 
subjecting the test amplified nucleic acid molecule to base specific cleavage to generate 
5 oligonucleotide fragments of from about 2 to about 1000 bases, separating the resulting 
oligonucleotide fragments based on mass by MALDI-TOF MS and/or other equivalent procedure 
to produce a fingerprint of the oligonucleotide fragments comprising one or more peaks wherein 
a peak represents the mass of each fragment and identifying an altered peak relative to a 
reference nucleic acid molecule subjected to the same procedure wherein the presence of an 
10 altered peak is indicative of a difference of one or more nucleotides in said tested nucleic acid 
molecule. 

Still yet another aspect of the present invention relates to a method of detecting a difference of 
one or more nucleotides between a nucleic acid molecule to be tested and a reference nucleic 

15 acid molecule, said method comprising amplifying said test nucleic acid molecule and 
incorporating uracil residues, subjecting the test amplified nucleic acid molecule to uracil specific 
cleavage mediated by a uracil-N-glycosylase to generate oligonucleotide fragments of from about 
2 to about 1000 bases, separating the resulting oligonucleotide fragments based on mass by 
MALDI-TOF MS and/or other equivalent procedure to produce a fingerprint of the 

20 oligonucleotide fragments comprising one or more peaks wherein a peak represents the mass of 
each fragment and identifying an altered peak relative to a reference nucleic acid molecule 
subjected to the same procedure wherein the presence of an altered peak is indicative of a 
difference of one or more nucleotides in said tested nucleic acid molecule. 

25 Another aspect of the present invention contemplates a computer programme capable of 
controlling a method of detecting a difference of one or more nucleotides between a nucleic acid 
molecule to be tested and a reference nucleic acid molecule, said method comprising subjecting 
the test nucleic acid molecule to base specific cleavage to generate oligonucleotide fragments, 
separating the resulting oligonucleotide fragments based on mass by MALDI-TOF MS and/or 

30 other equivalent procedure to produce a fingerprint of the oligonucleotide fragments comprising 
one or more peaks wherein a peak represents the mass of each fragment and identifying an 
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altered peak relative to a reference nucleic acid molecule subjected to the same procedure 
wherein the presence of an altered peak is indicative of a difference of one or more nucleotides 
in said tested nucleic acid molecule. 

5 Yet another aspect of the present invention is directed to an apparatus capable of detecting a 
difference of one or more nucleotides between a nucleic acid molecule to be tested and a 
reference nucleic acid molecule, said apparatus comprising means of subjecting the test nucleic 
. acid molecule to base specific cleavage to generate oligonucleotide fragments, separating the 
resulting oligonucleotide fragments based on mass by MALDI-TOF MS and/or other equivalent 
10 procedure to produce a fingerprint of the oligonucleotide fragments comprising one or more 
peaks wherein a peak represents the mass of each fragment and identifying an altered peak 
relative to a reference nucleic acid molecule subjected to the same procedure wherein the 
presence of an altered peak is indicative of a difference of one or more nucleotides in said tested 
nucleic acid molecule. 

15 

Still another aspect of the present invention provides a method of detecting a difference of one 
or more nucleotides between a nucleic acid molecule to be tested and a reference nucleic acid 
molecule, said method comprising subjecting the test nucleic acid molecule to base specific 
cleavage to generate oligonucleotide fragments, separating the resulting oligonucleotide 

20 fragments based on mass by MALDI-TOF MS and/or other equivalent procedure and subjecting 
said separated fragments to further separation means, such as post source decay (PSD) or other 
similar technique, to separate fragmentation products to generate a spectrum dependent on 
nucleotide sequence and then identifying an altered peak relative to a reference nucleic acid 
molecule subjected to the same procedure wherein the presence of an altered peak is indicative 

25 of a difference of one or more nucleotides in said tested nucleic acid molecule. 
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BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 is a graphical representation showing mass spectrogram of cleavage products of two 
oligonucleotides, 1 and 2, which differ at two nucleotides, one produces a fragment with a 
5 different nucleotide composition and the other introducing a new cleavage site. The two line 
thicknesses represent the overlaid tracings of the two different oligonucleotides. 1636.3 
represents a thick line peak and 3 1 90.9 represents a thin line peak. 1 8 1 1 . 1 is a thin line peak and 
1828.2 is a thick line peak. Kratos Kompact MALDI 4v5L2; % int. 100% = 24mV (thin); 
81mV (thick). 

10 

Figure 2 is a graphical representation showing mass spectrogram of reacted, separated products 
of normal TUB which represents a homozygote. Mode: linear; Accelerating Voltage: 20,000; 
Grid Voltage: 92.000%; Guide Wire Voltage 0-100%; Delay 12SON; Laser: 1800; Scans 
Averaged: 128; Pressure: 9.94e-07; Low Mass Gate: 900.0; Negative Ions: ON. 

15 

Figure 3 is graphical representation showing mass spectrogram of reacted, separated products 
of both TUB-M and TUB which represents a heterozygote. Mode: linear; Accelerating Voltage: 
20,000; Grid Voltage: 92.000%; Guide Wire Voltage 0-100%; Delay 1250N; Laser:1800; Scans 
Averaged: 128; Pressure: 1.89e-06; Low Mass Gate 900.0; Negative Ions: ON, 

20 

Figure 4 is a representation of the nucleotide sequence of EL- 12 untranslated region PCR 
product used in Example 13. Primers are shown in bold. Expected cleavage products >2bp are 
underlined. The polymorphism is at position 97 and is indicated by asterisk. The polymorphism 
is a C to T change which results in a change of the cleavage products at that position from CGA 
25 to AGA in the forward strand and CAAGC to CAA in the reverse stand. The presence of C at 
position 97 results in a TaqI site and this allele is called "+ n , the other allele is respectively 

Figure 5A is a photographic representation of a TaqI restriction digest of DL-12 PCR products 
from +/- individuals (lanes 1, 4 and 5), a +/+ individual (lane 3) and a -/- individual (lane 2). The 
30 124 bp fragment is cleaved by TaqI (where possible) to produce 97 and 27 bp fragments. 
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Figure SB is a graphical representation showing linear MALDI-TOF spectra of cleavage 
products. The spectra on the left show a mass range of 1000 to 3500 and those on the right are 
the same spectra but show in detail the mass range from 1000 to 1700. Spectra i a and b are 
from a -/- individual, spectra ii a and b are from a +/+ individual and spectra Hi a and b are from 
5 a +/- individual. Observed masses are indicated above peaks. Arrows show the peaks that 
change between the two alleles. 

Figure 6 is a graphical representation of the mass spectrum analysed using post source decay 
(PSD) on a MALDI-TOF instrument. Spectrum A is a 6mer of sequence CATCCT [SEQ ID 
10 NO: 16] and spectrum B a 6mer of sequence CACCTT [SEQ ID NO: 17]. Both have parent ion 
mass of 1727.2Da. Observed masses are shown above the peaks. PSD fragments are shown at 
an intensity magnification of five. 
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DETABLED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The present invention is predicated in part on a base specific cleavage reaction to generate a set 
of small oligonucleotides bounded by the base cleaved. The nucleic acid molecule may be 
5 completely or only partially cleaved or digested. These fragments are then separated based on 
mass by MALDI-TOF MS. This generates a fingerprint of the nucleic acid fragment comprising 
a series of peaks where each peak represents the mass of each small cleavage product. As a 
result of the sensitivity of mass determination, each oligonucleotide of given length but different 
nucleotide composition produces a different mass. The mass of each peak, therefore, 
10 corresponds to the nucleotide composition of the fragment as well as to its length. 
Consequently, any nucleotide substitution results in either a shifted peak due to the mass 
difference in the new cleavage fragment or, if the mutation changes the targeted base, a cleavage 
product containing a different number of bases. 

15 Accordingly, one aspect of the present invention contemplates a method of detecting a difference 
of one or more nucleotides between a nucleic acid molecule to be tested and a reference nucleic 
acid molecule, said method comprising subjecting the test nucleic acid molecule to base specific 
cleavage to generate oligonucleotide fragments, separating the resulting oligonucleotide 
fragments based on mass by MALDI-TOF MS and/or other equivalent procedure to produce a 

20 fingerprint of the oligonucleotide fragments comprising one or more peaks wherein a peak 
represents the mass of each fragment and identifying an altered peak relative to a reference 
nucleic acid molecule subjected to the same procedure wherein the presence of an altered peak 
is indicative of a difference of one or more nucleotides in said tested nucleic acid molecule. 

25 Conveniently, screening is carried out by comparing the cleavage product masses of the reference 
or wild-type nucleic acid to those of the test sample. Mass changes corresponding to base 
changes are readily observed. 

Accurate mass determination of these small fragments is possible allowing unambiguous 
30 assignation of base composition of each oligonucleotide. This knowledge allows deduction of 
the nature of the mutation and, after specific cleavage at different bases and integration of the 
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data, the position of the mutation. 

The method of the present invention is applicable to any nucleic acid molecule such as but not 
limited to DNA, genomic DNA, cDNA, plasmid DNA, satalite DNA, mRNA and other RNA 
5 molecules as well as DNArDNA, DNA:RNA and RNA:RNA hybrids. The present invention is 
particularly applicable to nucleic acid molecules amplified by, for example, polymerase chain 
reaction (PCR). 

According to this aspect of the present invention, there is provided a method of detecting a 
10 difference of one or more nucleotides between a nucleic acid molecule to be tested and a 
reference nucleic acid molecule, said method comprising amplifying said test nucleic acid 
molecule by polymerase chain reaction (PCR), subjecting the test amplified nucleic acid molecule 
to base specific cleavage to generate oligonucleotide fragments, separating the resulting 
oligonucleotide fragments based on mass by MALDI-TOF MS and/or other equivalent procedure 
15 to produce a fingerprint of the oligonucleotide fragments comprising one or more peaks wherein 
a peak represents the mass of each fragment and identifying an altered peak relative to a 
reference nucleic acid molecule subjected to the same procedure wherein the presence of an 
altered peak is indicative of a difference of one or more nucleotides in said tested nucleic acid 
molecule. 

20 

A particularly preferred requirement is that the source of nucleic acid is cleavable to 
oligonucleotide fragments of from 2 bases to 1000 bases, preferably of from 3 bases to 500 
bases, more preferably of from 4 bases to 100 bases and even more preferably of from 4 bases 
to 50 bases. Oligonucleotide fragments of form 4 bases to 40 bases are of particular usefulness 
25 in practising the present invention. 

Accordingly, the present invention is directed to a method of detecting a difference of one or 
more nucleotides between a nucleic acid molecule to be tested and a reference nucleic acid 
molecule, said method comprising amplifying said test nucleic acid molecule by PCR, subjecting 
30 the test amplified nucleic acid molecule to base specific cleavage to generate oligonucleotide 
fragments of from about 2 to about 1000 bases, separating the resulting oligonucleotide 



WO 98/54571 



PCT/AU98/00396 



- 11 - 

fragments based on mass by MALDI-TOF MS and/or other equivalent procedure to produce a 
fingerprint of the oligonucleotide fragments comprising one or more peaks wherein a peak 
represents the mass of each fragment and identifying an altered peak relative to a reference 
nucleic acid molecule subjected to the same procedure wherein the presence of an altered peak 
5 is indicative of a difference of one or more nucleotides in said tested nucleic acid molecule. 

The nucleic acid may be cleaved by a range of chemical molecules including enzymes. Enzymes 
are particularly preferred due to their specificity. One useful enzyme is uracil-N-glycosylase 
which cleaves DNA at uracil residues incorporated, for example, during a PCR. However, a 
10 range of enzymes may be employed. 

According to this embodiment, the present invention relates to a method of detecting a difference 
of one or more nucleotides between a nucleic acid molecule to be tested and a reference nucleic 
acid molecule, said method comprising amplifying said test nucleic acid molecule and 

15 incorporating uracil residues, subjecting the test amplified nucleic acid molecule to uracil specific 
cleavage mediated by a uracil-N-glycosylase to generate oligonucleotide fragments of from about 
2 to about 1000 bases, separating the resulting oligonucleotide fragments based on mass by 
MALDI-TOF MS and/or other equivalent procedure to produce a fingerprint of the 
oligonucleotide fragments comprising one or more peaks wherein a peak represents the mass of 

20 each fragment and identifying an altered peak relative to a reference nucleic acid molecule 
subjected to the same procedure wherein the presence of an altered peak is indicative of a 
difference of one or more nucleotides in said tested nucleic acid molecule. 

The method of the present invention is predicated in part on the fact that any oligonucleotide 
25 fragment differing in nucleotide composition between mutant and wild-type (or reference) 
sequences will be detected. The method has advantages over previously employed techniques 
and such advantages include the absence of a gel electrophoresis step thereby reducing time, 
expertise and need for separation equipment and the lack of dependance on toxic chemicals, such 
as osmium tetroxide. Whilst the present invention extends to the use of such chemicals in base 
30 specific cleavage reactions, it is preferred to use an enzymatic reaction to cleavage the target 
nucleic acid molecule. 
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The method of the present invention is particularly useful in detecting previously unknown 
mutations. This is important as a screening mechanism for inherited diseases and cancers such 
as during pre-natal diagnosis, diagnosis of a suspected disease or screening for carriers of disease 
alleles. It also has applications in polymorphism analysis of populations and in studies of 
5 evolution, drug resistance, virulence or attenuation of disease agents such as bacteria, viruses or 
protozoa. 

The method may be carried out simultaneously or sequentially with an analysis of a reference to 
wild-type nucleic acid molecule. Both the test and reference nucleic acid molecules can then be 
10 compared. Alternatively, the wild-type nucleic acid molecule may already have been analysed. 
Conveniently, this information may be stored electronically and upon completion of the analysis 
of the test nucleic acid molecule, both the test and reference sequences may then be compared 
manually, electronically or by a computer assisted means. 

15 The method of the present invention may also be used to determine the nucleotide sequence of 
a nucleic acid molecule. 

The nucleotide sequence may be completely determined or a partial sequence obtained for 
example, for selected nucleotides. The method of the present invention, therefore, permits the 
20 rapid determination of a nucleotide sequence which will be invaluable, for example, in the 
efficient analysis of mutations. 

The method of the present invention may be semi or fully automated and the present invention 
extends to apparatuses for automating the mutation detection assay. The apparatus may also be 
25 electronically controlled by a computer programme to facilitate the automation and/or analysis 
process. 

Accordingly, another aspect of the present invention contemplates a computer programme 
capable of controlling a method of detecting a difference of one or more nucleotides between a 
30 nucleic acid molecule to be tested and a reference nucleic acid molecule, said method comprising 
subjecting the test nucleic acid molecule to base specific cleavage to generate oligonucleotide 
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fragments, separating the resulting oligonucleotide fragments based on mass by MALDI-TOF 
MS or other equivalent procedure to produce a fingerprint of the oligonucleotide fragments 
comprising one or more peaks wherein a peak represents the mass of each fragment and 
identifying an altered peak relative to a reference nucleic acid molecule subjected to the same 
5 procedure wherein the presence of an altered peak is indicative of a difference of one or more 
nucleotides in said tested nucleic acid molecule. 

Yet another aspect of the present invention is directed to an apparatus capable of detecting a 
difference of one or more nucleotides between a nucleic acid molecule to be tested and a 

10 reference nucleic acid molecule, said apparatus comprising means of subjecting the test nucleic 
acid molecule to base specific cleavage to generate oligonucleotide fragments, separating the 
resulting oligonucleotide fragments based on mass by MALDI-TOF MS or other equivalent 
procedure to produce a fingerprint of the oligonucleotide fragments comprising one or more 
peaks wherein a peak represents the mass of each fragment and identifying an altered peak 

15 relative to a reference nucleic acid molecule subjected to the same procedure wherein the 
presence of an altered peak is indicative of a difference of one or more nucleotides in said tested 
nucleic acid molecule. 

In a particularly preferred embodiment, the method of apparatus of the present invention also 
20 employs a further fragment separation means such as but not limited to post source decay (PSD). 
PSD, for example, uses the dissociation of highly energised ions during their flight to the detector 
creating a second dimension. The ions are directed into an electric field of opposite polarity and 
are reflected. Smaller ions are reflected earlier and reach the detector first. As the spectrum 
from the decay is dependent on the nucleotide sequence of an oligonucleotide rather than the 
25 nucleotide composition, this avoids missing mutations in an oligonucleotide having the same 
nucleotide composition as a reference oligonucleotide. Although PSD is one convenient 
fragment separation means, the present invention extends to other similar techniques to separate 
fragmentation products. Generally these techniques are based on mass although may also be 
based on electrophoretic mobility, base size, base charge, base paring or other suitable criteria. 

30 

Accordingly, another aspect of the present invention provides a method of detecting a difference 
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of one or more nucleotides between a nucleic acid molecule to be tested and a reference nucleic 
acid molecule, said method comprising subjecting the test nucleic acid molecule to base specific 
cleavage to generate oligonucleotide fragments, separating the resulting oligonucleotide 
fragments based on mass by MALDI-TOF MS and/or other equivalent procedure and subjecting 
5 said separated fragments to further separation means to generate a spectrum dependent on 
nucleotide sequence and then identifying an altered peak relative to a reference nucleic acid 
molecule subjected to the same procedure wherein the presence of an altered peak is indicative 
of a difference of one or more nucleotides in said tested nucleic acid molecule. 

10 The MALDI-TOF MS analysis and further separation means may be done sequentially or 
simultaneously. 

Preferably, the further separation means includes or comprises PSD or other similar techniques 
to separate fragmentation products. 

15 

The present invention is particularly useful in identifying and/or locating mutants in 
heterozygotes. Mutations are detectable on both strains or on one strand only. 

Yet another aspect of the present invention provides a method for identifying and/or locating a 
20 mutation in one or more bases in a target nucleic acid molecule, subjecting the test nucleic acid 
molecule to base specific cleavage to generate oligonucleotide fragments, separating the resulting 
oligonucleotide fragments based on mass by MALDI-TOF MS and/or other equivalent procedure 
to produce a fingerprint of the oligonucleotide fragments comprising one or more peaks wherein 
a peak represents the mass of each fragment and identifying an altered peak relative to a 
25 reference nucleic acid molecule subjected to the same procedure wherein the presence of an 
altered peak is indicative of a difference of one or more nucleotides in said tested nucleic acid 
molecule. 

Preferably, the separated fragments are subjected to further separation means such as but not 
30 limited to PSD. 
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The present invention is further described by the following non-limiting Examples. 
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EXAMPLE 1 
OLIGONUCLEOTIDES 

Two test 22mers oligonucleotides with two bases different were used in this study 

5 

CCT CAT UTT TTU TTG TAA GAG G [SEQ ID NO: 1] 
CCT CGT UTT TTU TTG TUA GAG G [SEQ ID NO:2] 

The different bases are shown in bold. 

10 

For the detection of point mutations (see Example 7), the following oligonucleotides are used: 
TUB: 

GGT GAC CTG AAC CAC CTC GTG CGT CCA GCC GTT CGT GGC TGT CCA GTC CGC 
15 GAAC TCT GAC CTG CGC AAG [SEQ ID NO:3] 
TUB-M: 

GGT GAC CTG AAC CAC CTC GTG CGT CCA GCC GTT CGA GGC TGT CGA GTC 

CGCGAA CTC TGA CCT GCG CAA G [SEQ ID NO:4] 

TUB-F: 

20 GGT GAC CTG AAC CAC CTC GT [SEQ ID NO:5] 
TUB-R: 

CTT GCG CAG GTC AGA GTT [SEQ ID NO:6] 

TUB and TUB-M are used as template DNA and differ at three residues, bolded above, which 
25 comprise two point mutations and one insertion (bracketed and bolded). TUB-F and TUB-R are 
the "reverse" and "forward" primers used to amplify either TUB or TUB-M in a polymerase 
chain reaction. 



30 
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EXAMPLE 2 
CLEAVAGE REACTION 

The cleavage reactions were carried out using 100 pmol of oligonucleotide, 0.5 units uracil -N- 
5 glycoslyase (Perkin - Elmer) lxPCR buffer (50mM KC1, lOmM Tris-HCl pH 8.3) (Perkin-Elmer) 
in a 250jul reaction. The reaction mixture was incubated at 50°C for 20 minutes to allow 
cleavage of the N-glycosidic bond at uracil. It was then heated for 15 minutes to 105° C to allow 
degradation of the phosphate bonds at the basic sites. The mixture was then purified using anion 
exchange resin to remove buffer salts and other impurities. 

10 

EXAMPLE 3 
SAMPLE PURIFICATION 

Qiagen Anion Exchange Resin was equilibrated in 5mM NH 4 HC0 3 (Sigma) pH 8.4 (sodium 
15 free). 40^1 of the slurry was added to the reaction mixture and the DNA was allowed to bind 
at room temperature for 5 minutes with gentle shaking. The beads were spun down in a bench 
centrifuge and the supernatant discarded. The beads were then washed with 3x100^1 volumes 
of 5mM NH 4 HC0 3 pH 8.4 (sodium free) with incubation and centrifugation between each wash. 
The supernatant was discarded each time. The DNA fragments were then eluted using two 40//1 
20 volumes of 0.5M NH 4 HC0 3 pH 8.0 (sodium free), with incubation and centrifugation as before 
but with the supernatant being kept. The supernatant was then evaporated to dryness on a 
Savant Speedivac and resuspended twice in 20//1 distilled water and evaporated to dryness to 
remove any residual NH 4 HC0 3 . The final product was resuspended in 5^1 distilled water. The 
final concentration being approximately 20pmol///l. 

25 

EXAMPLE 4 

THE POLYMERASE CHAIN REACTIONS AND DNA 
URACIL GLYCOSYLASE REACTION 

30 20 ul reactions were set up containing 2.5mM MgCl 2 , 2.5 mM dATP, dCTP, dGTP, 5 mM 
dUTP, 0.5U Taq Gold (Perkin Elmer), 1.5 mM each TUB-F and TUB-R oligonucleotides and 
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2.4 fg or either TUB or TUB-M or a mix of both. PCR assays were incubated at 95°C for 15 
minutes then cycled at 95°C - 15 seconds, 60°C - 35 seconds, 72°C - 35 seconds for 40 cycles. 
PCR reactions were pooled, each pool contained either 10 or 100 PCR reactions. Uracil DNA 
glycosylase (Perkin Elmer) was added at a ratio of 1U per 10 PCR reactions. Completeness of 
5 digestion was confirmed by agarose gel electrophoresis. 



10 Each DNA glycosylase reaction was loaded onto a C8 aquapore RP300 column equilibrated with 
0. 1M TEAA, the column washed with 0. 1M TEAA at a flow rate of 0.5 mi/min and elute with 
0. 1M TEAA in 60% v/v CH 3 CN. Peaks were collected. Column eluates were desiccated on a 
Savant Speedivac, evaporative centrifuge, resuspended in water to the original volume and 
redessicated. Pellets were resuspended in 5 ml H 2 0. Mass spectrometric samples were prepared 

15 as described in Example 6. 



20 3-Hydroxypicolinic acid is prepared at a concentration of 75mg/ml in 1 : 1 acetonitrile and water 
and stored at room temperature in a closed vial in the dark. A new matrix solution is prepared 
weekly. Cation exchange beads (Bio-Rad, 50W-X4, mesh size 100-200/zm) in ammonium form 
were used to reduce interference from sodium and potassium adducts (Nordhoff et al, 1992). 
Samples were prepared as follows: 0.5^1 matrix, 0.5^1 sample (lOpmol DNA) and 0.5/^1 cation 

25 exchange resin were mixed on the slide and allowed to dry. The beads were then blown off with 
nitrogen gas. Samples were then analysed immediately. 

Samples were run on the Kratos Kompact MALDI 4 with 337nm laser or a Perspective Voyager 
MALDITOF machine. Linear negative mode was used for all spectra. Fifty shots were fired at 
30 power setting 70 to find a sweet spot and then a further 50 shots were Fired at the sweet spot to 
obtain the spectrum. 



EXAMPLE 5 



PURIFICATION OF DIGESTED PCR PRODUCTS 



EXAMPLE 6 



MASS ANALYSIS 
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EXAMPLE 7 
SIMULATION 

In order to assess the ability of this technique to detect mutations, a computer simulation was 
5 designed. Two different stimulations were conducted, one that models a mutation occurring in 
a haploid genome and the other modelling a mutation occurring in a diploid genome on the 
background of a wildtype sequence. 

In order to optimise the detection of mutations, four separate base specific cleavage reactions 
10 have been performed using separated forward and reverse strands and two different base specific 
reagents, in this case, thymidine and cytosine. A random library of exonic sequences has been 
extracted from Genbank. This comprises 100,000 kb of coding sequence concatenated into one 
file. Sequence strings of incremental length are removed from this file. A fingerprint for each 
strand is generated. This is calculated by generating the sets of post cleavage fragments for each 
15 base-specific reagent and sorting the non-redundant fragments. Mutant sequences are created 
by mutating every residue in the wild-type sequence to each of three possible alternatives. The 
fingerprint of each mutant is generated and compared to the wild-type fingerprints. If the 
fingerprints are different, it is recorded as a successful detection and the next mutant examined. 
If the first base specific cleavage reaction is unable to detect the mutation on the forward strand, 
20 the reverse strand is tried and so on until the reverse strand of the second reagent fails. This 
represents the total failure rate under the described conditions. Conceivably one could increase 
the power of the technique by using all four base specific reagents on both strands. 

EXAMPLE 8 

25 DETECTION OF BASE MUTATIONS 

Overlaid tracings from the mass spectrogram are presented in Figure 1 . These show the cleavage 
products of two oligonucleotides 1 and 2 [SEQ ID NO:l and SEQ ID NO:2, respectively], 
which differ at two nucleotides, one producing a fragment with a different nucleotide 
30 composition and the other introducing a new cleavage site. The new fragments resulting from 
these differences are easily separated. This example, observed masses deviate from calculated 
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by ±0.02-1%. This is sufficient to assign the correct base composition in this case, however, it 
is not sufficient to blindly assign base composition peaks from a sample of unknown sequence. 
A study has been done which concluded that all base compositions can be uniquely specified up 
to the 14mer level if one base has a known composition (ie. G=l in the case of the study, or in 
5 our case, T=0) with a measurement of mass to within ±0.01%. This is presently achievable, 
dependent on the mass analyser used and the sample quality and quantity (Pomerantz et al, 
1993). 

Base specific cleavage and mass spectrometry is, therefore, able to differentiate between two 
10 identical length oligonucleotides with different nucleotide compositions and hence is able to 
differentiate between two sequences differing at one base (Table 1). Where a mutation changes 
the residue involved directly in the base specific cleavage reaction (a "U" residue in the case 
presented here), the difference in size of the resultant products is marked (Table 1). The 
accuracy of mass determination allows deduction of the base composition of each fragment and 
15 therefore, where the sequence is known, will enable deduction of the nature of the mutation. 

Table 2 presents stimulation date for the haploid genome case and Table 3 presents the 
stimulation data where a mutation occurs in a diploid organism in the presence of a wild-type 
copy. These data are presented as cumulative "failure to identify" mutations based on both 
20 strands and two base specific cleavage reactions. Therefore, the last column, which is where the 
"C" reaction was unable to pick the mutation on the complementary strand represents the "total 
failure rate" of the technique under these conditions. 

EXAMPLE 9 

25 DETECTION OF POINT MUTATIONS 

The method of the present invention has been employed on PCR products and is able to detect 
point mutations and an insertion in DNA that has been amplified using the polymerase chain 
reaction as discussed below. The PCR templates used, TUB and TUB-M are described in 
30 Example 1 and have three differences, two of which are point mutations and the third is an 
insertion/deletion. All of these differences are visible in the mass spectrograms (Figures 2 and 
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3). Figure 3 represents the reacted, separated products of both TUB-M and TUB. This is a 
reconstruction of a heterozygote. Figure 2 is reacted, separated products of TUB, representing, 
in this case a homozygote normal. Table 4 gives the expected masses for each fragment and the 
corresponding comments on whether they have been seen. All mutations were seen on either 
5 both strands or on one strand only. 
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TABLE 1 



oligolxleavage products 

a CCTCAT' 

b TTTT 

c TTGTAAGAGG 2 



calc. mass 
1810.2 
1318.8 
3190.0 



obs.mass 
1811.1 
1318.4 
3190.9 



oligo2:cleavage products 



a 
b 
c 
d 



CCTCGT 3 
TTTT 
TTGT 
AGAGG 4 



1826.2 
1318.8 
1343.8 
1635.0 



1828.2 
1318.4 
1343.5 
1636.3 



SEQIDNO:18 
SEQIDNO:19 
SEQ ID NO:20 
SEQIDNO:21 
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TABLE 4 



EXPECTED TUB FRAGMENTS 




FRAGMENTS NOT SEEN 


GGC 


1045.6 






CCAC 


1198.8 






CCACA [SEQ ID NO:22] 


1512* 






CCAG 


1318.8 






GGAC 


1358.8 






CCAGCCG [SEQ ID NO:23] 


2226.4 






GCGCAAG [SEQ ID NO:24] 


2210.4 






GCGCAAGA [SEQ ID NO:25] 


2523.6* 






CCGCGAAC [SEQ ID NO:26] 


2539.6 






GGAGCACGCAGG [SEQ ID NO:7] 


3880.4 






CGGCAAGCACCGACAGG [SEQ ID NO:8] 


5374.4 






GGTGACCTGAACCACCTCGTGCG [SEQ ID NO:9] 


5888.8 


PRIMER 




CAGGCGCTTG AG ACTGG ACGCGT [SEQ ID NO: 1 0] 


6258 


PRIMER i 




EXPECTED TUB-M FRAGMENTS 1 








CCAC 


1198.8 


END 




CGAG 


1358.8 






GGAC 


1358.8 






CGAGGC [SEQ ID NO:27] 


1977.2 






LLAULtu [ofc-Q ID NU:Z5J 


2226.4 






GCGCAAG [SEQ ID NO:29] 


2210.4 






CGACAGCC [SEQ ID NO:30] 


2539.6 






CCGCGAAC [SEQ ID NO:31] 


2539.6 






CGAACGGC [SEQ ID NO:32] 


2579.6 






GGAGCACGCAGG [SEQ ID NO: 1 1] 


3880.4 






GGTGACCTGAACCACCTCGTGCG [SEQ ID NO:12] 


5888.8 


PRIMER 




CAGGCGCTTGAGACTGGACGCGT [SEQ ID NO:13] 


6258 


PRIMER 





Fragments obtained due to the terminal transferase activity of Taq polymerase which 
results in the addition of a dATP at the 3' end of the PCR product. 
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EXAMPLE 10 
MODIFICATION DETECTION PROTOCOL 

The method of Example 8 is employed except DNA polymerase enzymes are employed with the 
ability to incoiporate both dNTPs and rNTPs. Specific cleavage reactions are performed on PCR 
products in which one of the nucleotides is substituted for rNTP. This permits the base specific 
cleavage reactions to be conducted in alkali at high temperature. 

EXAMPLE 11 
IDENTIFICATION OF MUTATION POSITION 

The method of Example 8 employs Uracil-N-glycososylase which cleaves DNA at uracil. It is, 
therefore, a T reaction as uracil is replacing thymidine in the PCR product. In this Example, 
cleavage occurs at each of other bases so as to create sets of overlapping data to give 
information about the position of the mutation. 

EXAMPLE 12 
DETERMINATION OF NUCLEOTIDE SEQUENCE 

The method of the present invention is used to determine a nucleotide sequence of a nucleic acid 
fragment. The method employed is substantially as described in Example 8. 

EXAMPLE 13 

DETECTION OF PREVIOUSLY UNKNOWN MUTATIONS 

The method of the present invention is further demonstrated on a sequence polymorphism in the 
EL-12 gene. This previously unreported sequence change results in a TaqI RFLP and, therefore, 
can be followed by enzymatic digestion of PCR products. 
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Methods 



Template DNA was genomic DNA from human volunteers of each possible genotype of the IL- 
12 polymorphism (ie. +/+, +/-, and -/-, where + is the presence of the Taq restriction site). PCRs 
were carried out in 20jj1 reactions in 192 well plates in a Corbett Thermocycler with the 
following reaction mixture: 50mM KC1, lOmM Tris-HCl pH 8.3, 25mM MgCl 2 2.5mM dATP, 
dCTP and dGTP (Promega), 5mM dUTP (Boehringer Mannheim GmbH), 0.5U AmpliTaq Gold 
(Perkin Elmer), 0.4pM primers (Bresatec). After an initial 15min incubation at 95°C, the 
reactions were cycled 95°C 15 sees, 58°C 35sec, 72°C 35sec, for 40 cycles. 7 reactions were 
pooled for the homozygotes and 9 for the heterozygote. 1 unit of AmpErase Uracil-N- 
glycosylase (Perkin Elmer) was added to each pool and the reaction incubated at 50°C for 1 
hour, followed by 30 minutes at 105°C. The extend of completion of the cleavage reaction was 
monitored by the absence of a band on an agarose gel. The cleavage reaction was monitored by 
the absence of a band on an agarose gel. The cleavage products were purified using reverse 
phase HPLC on a 100x2. 1mm C8 aquapore RP300 column (Applied Biosystems). The flow rate 
was 0.5ml/min and absorbance was monitored at 254nm. The sample washed with 0.1M 
triethylaminoacetate (TEAA) and eluted in 0.1M TEAA/60% w/v acetonitrile and the fraction 
with absorbance at 254nm was collected and evaporated to dryness using a Savant Speedivac. 
The residue was resuspended in 100|il distilled deionised water and evaporated to dryness and 
then resuspended in 1 jal water. 0.5^1 of this was mixed with 0.5|il 3-hydroxypicolinic acid 
(saturated solution in 50% w/v acetonitrile and 0.5^1 NH 4 + ion-exchange beads (BioRad, 50W- 
X4, mesh size 100-200jim) on a sample slide. The mass spectrometer used to characterise the 
reaction products was a Voyager BioSpectrometry Workstation from PerSeptive Biosystems. 
128 laser pulses at power 1800 were averaged. Post Source Decay spectra were collected using 
a Kratos Kompact MALDI4 TOF mass spectrometer with 377nm laser and a curved field 
reflector in positive ion mode. Matrix and sample preparation as above. After scanning in linear 
mode for the sweet spot, the ion gate was set 34.8 Da above and 36.2 Da below the parent ion 
at 1727.2 Da. 200 profiles at 5 shots per profile were averaged. Spectra were corrected for the 
curved field. 

Genotypes were confirmed by demonstrating the presence or absence of the TaqI restriction site 
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by digesting PCR products with TaqI restriction enzyme (Gibco-BRL) and analysing the 
products by agarose electrophoresis. DNA bands were stained with ethidium bromide. 

A computer simulation of the method has been written and lOOkb of random coding sequence 
from Genbank has been fed into it. The program takes discrete-length bites of sequence from 
a file of concatenated cDNA sequence from Genbank. Each base is mutated to each hypothetical 
variant of the original sequence by removing the cleaved base leaving the residual short strings. 
The mass spectrometry was modelled, fragments of different nucleotide composition being 
distinguishable and those of identical composition being indistinguishable. As quantitation is 
difficult on the MALDI, changes in peak height was not used as an indication of a change in 
underlying sequence. The program then compares "spectra" and tallies the number of mutations 
that were missed. The program can model the detection of a mutation in the presence of a wild- 
type sequence (heterozygote) or can model the differences between two homozygotes. In the 
first case a mutation can only be detected by the presence of a new peak and in the latter case, 
as well as the presence of a new peak, the disappearance of a peak can also signal a change. All 
four base specific cleavage reactions were used and reactions were performed on separated 
strands giving a total of 8 reactions per PCR product. Also the model has been refined to take 
account of the ability of post source decay (PSD) to identify changes in peaks containing a 
complex mix of oligonucleotides. In this case fragments of different sequence are 
distinguishable. 



A PCR assay was designed to incorporate the mutated region and then subjected to uracil -N 
glycosylase treatment. The products were purified and analysed by MALDI-TOF mass 
spectrometry. The sequence of the PCR primers and product along with the mutation are shown 
in Figure 4. The C to T change gives rise to a Taq RFLP and this can be seen in homozygote 
and heterozygote state in Figure 5. The spectra generated by the MALDI-TOF can also be seen 
in Figure 5. The expected and observed masses of the cleavage products from the two alleles 
are given in Table 5. The position of the mutation and deduction of the changed base is evident 
from study of this Table. 



Results 



WO 98/54571 PCT/AU98/00396 



-29- 



A limitation to the sensitivity of this method results from the lack of quantitative data available 
from the MALDI. When the fragment derived from the mutated sequence coincides with other 
fragments of identical nucleotide composition in the wild-type sequence, its disappearance will 
go undetected. Similarly, the appearance of a new fragment in the mutated sequence will go 
unnoticed of it has identical nucleotide composition to one or more other cleavage products. If 
both these conditions exist for all cleavage reactions, then the mutation will be missed. This 
technique, therefore, is not as advantageous for longer fragment as for small fragments. 

To address this problem, the inventors employed a second dimension detection protocol on the 
MALDI-TOF machine. Post source decay (PSD) uses the dissociation of the highly energised 
ions during their flight to the detector as this second dimension. They are directed into an 
electric field of opposite polarity and are reflected. The smaller ions are reflected earlier and 
reach the detector first. As the spectrum from the decay is dependent on the sequence of the 
oligonucleotide (and not the nucleotide composition), the aforementioned limitation is bypassed, 
generating a method of mutation detection that is now extremely sensitive. 

The utility of MALDI-TOF analysis with PSD is demonstrated in Figure 6 where two 
oligonucleotides of identical nucleotide composition are separated by MALDI-TOF using PSD. 
The resulting spectra are quite distinguishable. Sequence determination of small oligonucleotides 
is feasible using molecular dissociation methods and, therefore, the subject method extrapolates 
into an accurate resequencing protocol. 

A computer simulation of data from the linear separation of cleavage products has been written. 
Using Genbank data, the expected number of base substitution that would be identified when 
comparing two homozygotes over a 250bp PCR distance is 98.5%. the comparable figure is 
95% when a homozygote is compared to a heterozygote. If each mass peak from a base specific 
cleavage is analysed using a secondary dissociation technique, eg. PSD on the MALDI-TOF 
machine, then sensitivity of mutation detection improves dramatically. This has also been 
simulated and for a lOOObp fragment subjected to base specific cleavage, and analysed with PSD, 
99% of all substitutions will be detected for a homozygote to heterozygote comparison and 
99.8% when two homozygotes are compared. 
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Those skilled in the art will appreciate that the invention described herein is susceptible to 
variations and modifications other than those specifically described. It is to be understood that 
the invention includes all such variations and modifications. The invention also includes all of 
the steps, features, compositions and compounds referred to or indicated in this specification, 
individually or collectively, and any and all combinations of any two or more of said steps or 
features. 
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SEQUENCE LISTING 
(1) GENERAL INFORMATION: 

(i) APPLICANT: (Other than US): THE WALTER AND ELIZA HALL INSTITUTE OF 

MEDICAL RESEARCH 

(US only): FOOTE Simon, ELSO Colleen, SIMPSON Richard, REID 
Gavin 

(ii) TITLE OF INVENTION: A NOVEL METHOD 

(iii) NUMBER OF SEQUENCES : 32 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: DAVIES COLLISON CAVE 

(B) STREET: 1 LITTLE COLLINS STREET 

(C) CITY: MELBOURNE 

(D) STATE: VICTORIA 

(E) COUNTRY: AUSTRALIA 

(F) ZIP: 3000 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT INTERNATIONAL 

(B) FILING DATE: 28 MAY 1998 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA 

(A) APPLICATION NUMBER: PO 7102 

(B) FILING DATE: 28 MAY 1997 

(A) APPLICATION NUMBER: PO 7109 

(B) FILING DATE: 30 MAY 1997 

(A) APPLICATION NUMBER: PP 1665 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
CCTCATUTTT TUTTG TAAG A GG 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 



CCTCGTUTTT TUTTGTUAGA GG 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

GGTGACCTGA ACCACCTCGT GCGTCCAGCC GTTCGTGGCT GTCCAGTCCG 
CAAACTCTGA CCTGCGCAAG 
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(2) INFORMATION FOR SEQ ID NO: 4: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

GGTGACC TG A ACCACCTCGTG CGTCCAGCCG TTCGAGGCTG TCGAGTCCGC 50 
(O)AACTCTGAC CTGCGCAAG 69 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
GGTGACCTGA ACCACCTCGT 2 0 



(2) INFORMATION FOR SEQ ID NO : 6 : 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 



CTTGCGCAGG TCAGAGTT 



18 
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(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
GGAGCACGCAG G 

(2) INFORMATION FOR SEQ ID NO : 8 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
CGGCAAGCAC CGACAGG 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
GGTGACCTGA ACCACCTCGT GCG 
(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CAGGCGCTTG AGAC TGGACG CGT 2 3 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GGAGCACGCA GG 12 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GGTGACCTGA ACCACCTCGT GCG 23 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



CAGGCGCTTG AG ACTGG AC G CGT 



23 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

CACAACGGAA TAGACCCAAA AAGAUAAUUU CUAUCUGAUU UGCUUUAAAA 50 
CGUUUUUUUA GGAUCACAAU GAUAUCUUUG CUGUAUUUGU AUAGUUCGAU 5 0 

GCUAAAUGCU CAUUGAAACA AUCA 2 4 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

GUGUUGCCUU AUCUGGGUUU UUCUAUUAAA GAUAGACUAA ACGAAAUUUU 50 
GCAAAAAAAU CCUAGUGUUA CUAUAGAAAC GACAUAAACA UAUCAAGCUA 50 
CGATTTACGA GTAACTTTGT TAGT 2 4 

(2) INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
CATCCT 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
CACCTT 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
CCTCAT 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 : 
TTGTAAGAGG 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : S ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CCTCGT 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: -DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
AGAGG 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CCACA 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CCAGCCG 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GCGCAAG 

(2) INFORMATION FOR SEQ ID NO : 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 5 
GCGCAAGA 
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(2) INFORMATION FOR SEQ ID NO : 26: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 8 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 26: 
CCGCGAAC 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
CGAGGC 

(2) INFORMATION FOR SEQ ID NO : 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
CCAGCCG 

(2) INFORMATION FOR SEQ ID NO : 29: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 base pairs 
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(D) 



(B) 



(C) 



TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 



(ii) 



MOLECULE TYPE: DNA 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



GCGCAAG 



7 



(2) INFORMATION FOR SEQ ID NO : 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
CGACAGCC 8 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
CCGCGAAC 8 
(2) INFORMATION FOR SEQ ID NO : 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CGAACGGC 
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CLAIMS: 

1 . A method of detecting a difference of one or more nucleotides between a nucleic acid 
molecule to be tested and a reference nucleic acid molecule, said method comprising subjecting 
the test nucleic acid molecule to base specific cleavage to generate oligonucleotide fragments, 
separating the resulting oligonucleotide fragments based on mass by M ALDI-TOF MS and/or 
other equivalent procedure to produce a fingerprint of the oligonucleotide fragments comprising 
one or more peaks wherein a peak represents the mass of each fragment and identifying an 
altered peak relative to a reference nucleic acid molecule subjected to the same procedure 
wherein the presence of an altered peak is indicative of a difference of one or more nucleotides 
in said tested nucleic acid molecule. 

2. A method according to claim 1 wherein the nucleic acid molecule to be tested is amplified 
by a polymerase chain reaction (PCR) prior to base specific cleavage. 

3. A method according to claim 1 or 2 wherein the base specific cleavage results in 
oligonucleotide fragments of from about 2 bases to about 1000 bases. 

4. A method according to claim 3 wherein the base specific cleavage results in 
oligonucleotide fragments of from about 3 bases to about 500 bases. 

5. A method according to claim 4 wherein the base specific cleavage results in 
oligonucleotide fragments of from about 4 bases to about 100 bases. 

6. A method according to any one of claims 1 to 5 wherein the base specific cleavage is 
uracil specific cleavage. 

7. A method according to claim 6 wherein the uracil specific cleavage is mediated by uracil- 
N-glycosylase. 

8. A method according to any one of claims 1 to 7 further comprising subjecting 
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fragmentation products to further separation (PSD) to generate a spectrum from decay 
dependent on the nucleotide sequence of the oligonucleotide. 

9. A method according to claim 8 wherein the further separation of fragmentation products 
is by post source decay (PSD). 

10. A computer programme capable of controlling a method of detecting a difference of one 
or more nucleotides between a nucleic acid molecule to be tested and a reference nucleic acid 
molecule, said method comprising subjecting the test nucleic acid molecule to base specific 
cleavage to generate oligonucleotide fragments, separating the resulting oligonucleotide 
fragments based on mass by MALDI-TOF MS and/or other equivalent procedure to produce a 
fingerprint of the oligonucleotide fragments comprising one or more peaks wherein a peak 
represents the mass of each fragment and identifying an altered peak relative to a reference 
nucleic acid molecule subjected to the same procedure wherein the presence of an altered peak 
is indicative of a difference of one or more nucleotides in said tested nucleic acid molecule. 

11. A method according to claim 9 wherein the nucleic acid to be tested is amplified by PCR 
prior to base specific cleavage. 

12. A method according to claim 9 or 10 wherein the base specific cleavage results in 
oligonucleotide fragments of from about 2 bases to about 1000 bases. 

13. A method according to claim 9 wherein the base specific cleavage results in 
oligonucleotide fragments of from about 3 bases to about 500 bases. 

14. A method according to claim 10 wherein the base specific cleavage results in 
oligonucleotide fragments of from about 4 bases to about 100 bases. 

15. A method according to any one of claims 9 to 13 wherein the base specific cleavage is 
uracil specific cleavage. 
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16. A method according to claim 14 wherein the uracil specific cleavage is mediated by 
uracil-N-glycosylase. 

17. A method according to any one of claims 10 to 16 further comprising the further 
separation of fragmentation products to generate a spectrum from decay dependent on the 
nucleotide sequence of the oligonucleotide. 

18. A method according to claim 17 wherein the further separation of fragmentation products 
is by post source decay (PSD). 

19. An apparatus capable of detecting a difference of one or more nucleotides between a 
nucleic acid molecule to be tested and a reference nucleic acid molecule, said apparatus 
comprising means of subjecting the test nucleic acid molecule to base specific cleavage to 
generate oligonucleotide fragments, separating the resulting oligonucleotide fragments based on 
mass by MALDI-TOF MS and/or other equivalent procedure to produce a fingerprint of the 
oligonucleotide fragments comprising one or more peaks wherein a peak represents the mass of 
each fragment and identifying an altered peak relative to a reference nucleic acid molecule 
subjected to the same procedure wherein the presence of an altered peak is indicative of a 
difference of one or more nucleotides in said tested nucleic acid molecule. 



20. An apparatus according to claim 19 further comprising further fragmentation separation 
means to generate a spectrum from decay dependent on the nucleotide sequence of the 
oligonucleotide. 

21 . An apparatus according to claim 20 wherein the further fragmentation separation means 
is post source decay (PSD). 

22. Use of MALDI-TOF in the detection of a difference of one or more nucleotides between 
a nucleic acid molecule to be tested and a reference nucleic acid molecule. 



Use according to claim 22 further comprising use of PSD to generate a spectrum for 



WO 98/54571 



PCT/AU98/00396 



-48- 

decay dependent on the sequence of an oligonucleotide. 

24. A method for identifying and/or locating a mutation in one or more bases in a target 
nucleic acid molecule, subjecting the test nucleic acid molecule to base specific cleavage to 
generate oligonucleotide fragments, separating the resulting oligonucleotide fragments based on 
mass by MALDI-TOF MS and/or other equivalent procedure to produce a fingerprint of the 
oligonucleotide fragments comprising one or more peaks wherein a peak represents the mass of 
each fragment and identifying an altered peak relative to a reference nucleic acid molecule 
subjected to the same procedure wherein the presence of an altered peak is indicative of a 
difference of one or more nucleotides in said tested nucleic acid molecule. 

25. A method according to claim 24 wherein the nucleic acid molecule to be tested is 
amplified by a polymerase chain reaction (PCR) prior to base specific cleavage. 

26. A method according to claim 24 or 25 wherein the base specific cleavage results in 
oligonucleotide fragments of from about 2 bases to about 1000 bases. 

27. A method according to claim 26 wherein the base specific cleavage results in 
oligonucleotide fragments of from about 3 bases to about 500 bases. 

28. A method according to claim 27 wherein the base specific cleavage results in 
oligonucleotide fragments of from about 4 bases to about 100 bases. 

29. A method according to any one of claims 24 to 28 wherein the base specific cleavage is 
uracil specific cleavage. 

30. A method according to claim 29 wherein the uracil specific cleavage is mediated by 
uracil-N-glycosylase. 

31. A method according to any one of claims 24 to 30 further comprising subjecting 
fragmentation products to further separation (PSD) to generate a spectrum from decay 
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dependent on the nucleotide sequence of the oligonucleotide. 

32. A method according to claim 3 1 wherein the further separation of fragmentation products 
is by post source decay (PSD). 
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